41 research outputs found
Recommended from our members
Pluripotency factors functionally premark cell-type-restricted enhancers in ES cells.
Enhancers for embryonic stem (ES) cell-expressed genes and lineage-determining factors are characterized by conventional marks of enhancer activation in ES cells1-3, but it remains unclear whether enhancers destined to regulate cell-type-restricted transcription units might also have distinct signatures in ES cells. Here we show that cell-type-restricted enhancers are 'premarked' and activated as transcription units by the binding of one or two ES cell transcription factors, although they do not exhibit traditional enhancer epigenetic marks in ES cells, thus uncovering the initial temporal origins of cell-type-restricted enhancers. This premarking is required for future cell-type-restricted enhancer activity in the differentiated cells, with the strength of the ES cell signature being functionally important for the subsequent robustness of cell-type-restricted enhancer activation. We have experimentally validated this model in macrophage-restricted enhancers and neural precursor cell (NPC)-restricted enhancers using ES cell-derived macrophages or NPCs, edited to contain specific ES cell transcription factor motif deletions. DNA hydroxyl-methylation of enhancers in ES cells, determined by ES cell transcription factors, may serve as a potential molecular memory for subsequent enhancer activation in mature macrophages. These findings suggest that the massive repertoire of cell-type-restricted enhancers are essentially hierarchically and obligatorily premarked by binding of a defining ES cell transcription factor in ES cells, dictating the robustness of enhancer activation in mature cells
Recommended from our members
Allele-specific NKX2-5 binding underlies multiple genetic associations with human electrocardiographic traits.
The cardiac transcription factor (TF) gene NKX2-5 has been associated with electrocardiographic (EKG) traits through genome-wide association studies (GWASs), but the extent to which differential binding of NKX2-5 at common regulatory variants contributes to these traits has not yet been studied. We analyzed transcriptomic and epigenomic data from induced pluripotent stem cell-derived cardiomyocytes from seven related individuals, and identified ~2,000 single-nucleotide variants associated with allele-specific effects (ASE-SNVs) on NKX2-5 binding. NKX2-5 ASE-SNVs were enriched for altered TF motifs, for heart-specific expression quantitative trait loci and for EKG GWAS signals. Using fine-mapping combined with epigenomic data from induced pluripotent stem cell-derived cardiomyocytes, we prioritized candidate causal variants for EKG traits, many of which were NKX2-5 ASE-SNVs. Experimentally characterizing two NKX2-5 ASE-SNVs (rs3807989 and rs590041) showed that they modulate the expression of target genes via differential protein binding in cardiac cells, indicating that they are functional variants underlying EKG GWAS signals. Our results show that differential NKX2-5 binding at numerous regulatory variants across the genome contributes to EKG phenotypes
Single nucleotide polymorphisms at the TRAF1/C5 locus are associated with rheumatoid arthritis in a Han Chinese population
<p>Abstract</p> <p>Background</p> <p>Genetic variants in <it>TRAF1C5 </it>and <it>PTPN22 </it>genes have been shown to be significantly associated with arthritis rheumatoid in Caucasian populations. This study investigated the association between single nucleotide polymorphisms (SNPs) in <it>TRAF1/C5 </it>and <it>PTPN22 </it>genes and rheumatoid arthritis (RA) in a Han Chinese population. We genotyped SNPs rs3761847 and rs7021206 at the <it>TRAF1/C5 </it>locus and rs2476601 SNP in the <it>PTPN22 </it>gene in a Han Chinese cohort composed of 576 patients with RA and 689 controls. The concentrations of anti-cyclic citrullinated peptide antibodies (CCP) and rheumatoid factor (RF) were determined for all affected patients. The difference between the cases and the controls was compared using <it>χ</it><sup>2 </sup>analysis.</p> <p>Results</p> <p>Significant differences in SNPs rs3761847 and rs7021206 at <it>TRAF1/C5 </it>were observed between the case and control groups in this cohort; the allelic p-value was 0.0018 with an odds ratio of 1.28 for rs3761847 and 0.005 with an odds ratio of 1.27 for rs7021206. This significant association between rs3761847 and RA was independent of the concentrations of anti-CCP and RF. No polymorphism of rs2476601 was observed in this cohort.</p> <p>Conclusions</p> <p>We first demonstrated that genetic variants at the <it>TRAF1/C5 </it>locus are significantly associated with RA in Han Chinese, suggesting that <it>TRAF1/C5 </it>may play a role in the development of RA in this population, which expands the pathogenesis role of <it>TRAF1/C5 </it>in a different ethnicity.</p
Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
Federated learning (FL) can tackle the problem of data silos of asymmetric information and privacy leakage; however, it still has shortcomings, such as data heterogeneity, high communication cost and uneven distribution of performance. To overcome these issues and achieve parameter optimization of FL on non-Independent Identically Distributed (non-IID) data, a multi-objective FL parameter optimization method based on hierarchical clustering and the third-generation non-dominated sorted genetic algorithm III (NSGA-III) algorithm is proposed, which aims to simultaneously minimize the global model error rate, global model accuracy distribution variance and communication cost. The introduction of a hierarchical clustering algorithm on non-IID data can accelerate convergence so that FL can employ an evolutionary algorithm with a low FL client participation ratio, reducing the overall communication cost of the NSGA-III algorithm. Meanwhile, the NSGA-III algorithm, with fast greedy initialization and a strategy of discarding low-quality individuals (named NSGA-III-FD), is proposed to improve the convergence efficiency and the quality of Pareto-optimal solutions. Under two non-IID data settings, the CNN experiments on both MNIST and CIFAR-10 datasets show that our approach can obtain better Pareto-optimal solutions than classical evolutionary algorithms, and the selected solutions with an optimized model can achieve better multi-objective equilibrium than the standard federated averaging (FedAvg) algorithm and the Clustering-based FedAvg algorithm
Federated learning based on stratified sampling and regularization
Abstract Federated learning (FL) is a new distributed learning framework that is different from traditional distributed machine learning: (1) differences in communication, computing, and storage performance among devices (device heterogeneity), (2) differences in data distribution and data volume (data heterogeneity), and (3) high communication consumption. Under heterogeneous conditions, the data distribution of clients varies greatly, which leads to the problem that the convergence speed of the training model decreases and the training model cannot converge to the global optimal solution. In this work, an FL algorithm based on stratified sampling and regularization (FedSSAR) is proposed. In FedSSAR, a density-based clustering method is used to divide the overall client into different clusters, then, some available clients are proportionally extracted from different clusters to participate in training which realizes unbiased sampling for the overall client and reduces the aggregation weight variance of the client. At the same time, when calculating the model local loss function, we limit the update direction of the model by a regular term, so that heterogeneous clients are optimized in the globally optimal direction. We prove the convergence of FedSSAR theoretically and experimentally, and demonstrate the superiority of FedSSAR by comparing it with other FL algorithms on public datasets